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PATENT 

ATTORNEY DOCKET NO. 06132/041002 

IDENTIFICATION OF POLYNUCLEOTIDES ENCODING NOVEL 
HELICOBACTER POLYPEPTIDES IN THE HELICOBACTER GENOME 
5 Priority Information 

This application is a continuation of, and claims priority from, U.S. Serial No. 
08/833,457, filed on April 1, 1997, which is incorporated by reference herein in its 
entirety. 

Field of the Invention 
1 0 The invention relates to Helicobacter antigens and corresponding 

polynucleotide molecules that can be used in methods to prevent or treat Helicobacter 
infection in mammals, such as humans. 



Background of the Invention 
Helicobacter is a genus of spiral, gram-negative bacteria that colonize the 
15 gastrointestinal tracts of mammals. Several species colonize the stomach, most notably 
H. pylori, H. heilmanii, H. felis, and H, mustelae. Although H. pylori is the species most 
commonly associated with human infection, H, heilmanii and H.felis have also been 
isolated from humans, but at lower frequencies than H. pylori. Helicobacter infects over 
50% of adult populations in developed countries and nearly 100% in developing countries 
2 0 and some Pacific rim countries, making it one of the most prevalent infections 
worldwide. 

Helicobacter is routinely recovered from gastric biopsies of humans with 
histological evidence of gastritis and peptic ulceration. Indeed, H, pylori is now 
recognized as an important pathogen of humans, in that the chronic gastritis it causes is a 
2 5 risk factor for the development of peptic ulcer diseases and gastric carcinoma. It is thus 




highly desirable to develop safe and effective vaccines for preventing and treating 
Helicobacter infection. 

A number of Helicobacter antigens have been characterized or isolated. These 
include urease, which is composed of two structural subunits of approximately 30 and 
5 67 kDa (Hu et aL, Infect. Immun. 58:992, 1990; Dunn et aL, J. Biol. Chem. 265:9464, 
1990; Evans et aL, Microbial Pathogenesis 10:15, 1991; Labigne et aL, J. Bact., 
173:1920, 1991); the 87 kDa vacuolar cytotoxin (VacA) (Cover et aL, J. Biol. Chem. 
267:10570, 1992; Phadnis et aL, Infect. Immun. 62:1557, 1994; WO 93/18150); a 128 
kDa immunodominant antigen associated with the cytotoxin (CagA, also called TagA; 

10 WO 93/18150; U.S. Patent No. 5,403,924); 13 and 58 kDa heat shock proteins HspA and 
HspB (Suerbaum et aL, Mol. Microbiol. 14:959, 1994; WO 93/18150); a 54 kDa catalase 
(Hazell et aL, J. Gen. Microbiol. 137:57, 1991); a 15 kDa histidine-rich protein (Hpn) 
(Gilbert et aL, Infect. Immun. 63:2682, 1995); a 20 kDa membrane-associated lipoprotein 
(Kostrcynska et aL, J. Bact. 176:5938, 1994); a 30 kDa outer membrane protein (Bolin et 

15 aL, J. Clin. Microbiol. 33:381, 1995); a lactoferrin receptor (FR 2,724,936); and several 
porins, designated Hop A, HopB, HopC, HopD, and HopE, which have molecular weights 
of 48-67 kDa (Exner et aL, Infect. Immun. 63:1567, 1995; Doig et aL, J. Bact. 177:5447, 
1995). Some of these proteins have been proposed as potential vaccine antigens. In 
particular, urease is believed to be a vaccine candidate (WO 94/9823; WO 95/22987; WO 

2 0 95/3824; Michetti et aL, Gastroenterology 107:1002, 1994). Nevertheless, it is thought 
that several antigens may ultimately be necessary in a vaccine. 

Summary of the Invention 
The invention provides polynucleotide molecules that encode Helicobacter 
polypeptides, designated GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 
2 5 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541 
(SEQ ID NO: 12), GHPO 607 (SEQ ID NO: 14), GHPO 635 (SEQ ID NO: 16), GHPO 701 
(SEQ ID NO: 18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 

- 2 - 




(SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 
1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), 
GHPO 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID 
NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID 
5 NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ 
ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 
(SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 
(SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 
(SEQ ID NO:70), GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 

1 0 (SEQ ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 
(SEQ ID NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 
1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), 
GHPO 1052 (SEQ ID NO:94), GHPO 1 127 (SEQ ID NO:96), GHPO 1 149 (SEQ ID 
NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID NO: 102), GHPO 1312 

15 (SEQ ID NO:104), GHPO 1358 (SEQ ID NO:106), GHPO 1490 (SEQ ID NO:108), 

GHPO 1559 (SEQ ID NO:l 10), GHPO 1651 (SEQ ID NO:l 12), GHPO 1726 (SEQ ID 
NO: 1 14), GHPO 1780 (SEQ ID NO: 1 16), GHPO 895 (SEQ ID NO: 118), GHPO 1447 
(SEQ ID NO: 120), GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 
155 (SEQ ID NO: 126), GHPO 157 (SEQ ID NO:128), GHPO 237 (SEQ ID NO: 130), 

2 0 GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID 
NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 
(SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO: 146), GHPO 
675 (SEQ ID NO: 148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), 
GHPO 850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID 

2 5 NO: 158), GHPO 989 (SEQ ID NO:160), GHPO 1111 (SEQ ID NO: 162), GHPO 1 145 
(SEQ ID NO: 164), GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), 
GHPO 1316 (SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID 
NO:174), GHPO 1506 (SEQ ID NO:176), GHPO 1543 (SEQ ID NO:178), GHPO 1574 




(SEQ ID NO:180), GHPO 1627 (SEQ ID NO:182), GHPO 1657 (SEQ ID NO:184), 
GHPO 1664 (SEQ ID NO:186), GHPO 1694 (SEQ ID NO:188), GHPO 1704 (SEQ ID 
NO: 190), GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 
(SEQ ID NO: 196), GHPO 109 (SEQ ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 
5 169 (SEQ ID NO:202), GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), 
GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID 
NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 
(SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 
886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 894 (SEQ ID NO:228), 

1 0 GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID 
NO:234), GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 
(SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), 
GHPO 1388 (SEQ ID NO:246), GHPO 141 1 (SEQ ID NO:248), GHPO 1419 (SEQ ID 
NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 

1 5 (SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), 

GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID 
NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 
(SEQ ID NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 
257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), 

2 0 GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID 
NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 
(SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 
646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID NO:304), 
GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID 

2 5 NO:310), GHPO 1 102 (SEQ ID NO:312), GHPO 1 1 14 (SEQ ID NO:314), GHPO 1 152 
(SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), 
GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID 
NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 




(SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 
708 (SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), 
GHPO 1050 (SEQ ID NO:344), GHPO 1 101 (SEQ ID NO:346), GHPO 1 120 (SEQ ID 
NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 
5 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), 
GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089 (SEQ ID 
NO:364), GHPO 1141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), and GHPO 
1608 (SEQ ID NO:370), which can be used, e.g., in methods to prevent, treat, or diagnose 
Helicobacter infection. The polypeptides of the invention include those having the amino 

1 0 acid sequences shown in the sequence listing, as well as mature forms of proteins having 
sequences shown in the sequence listing in their unprocessed forms, and fragments 
thereof. Those skilled in the art will understand that the invention also includes 
polynucleotide molecules that encode mutants and derivatives of these polypeptides, 
which can result from the addition, deletion, or substitution of non-essential amino acids, 

15 as is described further below. 

In addition to the polynucleotide molecules described above, the invention 
includes the corresponding polypeptides (i.e., polypeptides encoded by the polynucleotide 
molecules of the invention, or fragments thereof), and monospecific antibodies that 
specifically bind to these polypeptides. 

2 0 The present invention has many applications and includes expression 

cassettes, vectors, and cells transformed or transfected with the polynucleotides of the 
invention. Accordingly, the present invention provides (i) methods for producing 
polypeptides of the invention in recombinant host systems and related expression 
cassettes, vectors, and transformed or transfected cells; (ii) live vaccine vectors, such as 

2 5 pox virus, Salmonella typhimurium, and Vibrio cholerae vectors, that contain 

polynucleotides of the invention (such vaccine vectors being useful in, e.g., methods for 
preventing or treating Helicobacter infection) in combination with a diluent or carrier, and 
related pharmaceutical compositions and associated therapeutic and/or prophylactic 




methods; (iii) therapeutic and/or prophylactic methods involving administration of 
polynucleotide molecules, either in a naked form or formulated with a delivery vehicle, 
polypeptides or mixtures of polypeptides, or monospecific antibodies of the invention, 
and related pharmaceutical compositions; (iv) methods for detecting the presence of 
5 Helicobacter in biological samples, which can involve the use of polynucleotide 

molecules, monospecific antibodies, or polypeptides of the invention; and (v) methods for 
purifying polypeptides of the invention by antibody-based affinity chromatography. 

Detailed Description 
Open reading frames (ORFs) encoding new polypeptides, designated GHPO 

10 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 

(SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 
(SEQ ID NO:14), GHPO 635 (SEQ ID NO:16), GHPO 701 (SEQ ID NO:18), GHPO 712 
(SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 
1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), 

15 GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID 

NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO 314 (SEQ 
ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ 
ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1 151 
(SEQ ID NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 

20 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 
200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 
695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 
709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 
827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO:88), 

25 GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID 
NO:94), GHPO 1 127 (SEQ ID NO:96), GHPO 1 149 (SEQ ID NO:98), GHPO 1 176 
(SEQ ID NO:100), GHPO 1250 (SEQ ID NO:102), GHPO 1312 (SEQ ID NO:104), 




GHPO 1358 (SEQ ID NO: 106), GHPO 1490 (SEQ ID NO: 108), GHPO 1559 (SEQ ID 
NO:110), GHPO 1651 (SEQ ID NO:112), GHPO 1726 (SEQ ID NO:114), GHPO 1780 
(SEQ ID NO:116), GHPO 895 (SEQ ID NO:118), GHPO 1447 (SEQ ID NO:120), 
GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID 
5 NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 

(SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 
374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 (SEQ ID NO: 142), 
GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO: 146), GHPO 675 (SEQ ID 
NO: 148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 

10 (SEQ ID NO:154), GHPO 876 (SEQ ID NO:156), GHPO 984 (SEQ ID NO:158), GHPO 
989 (SEQ ID NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), 
GHPO 1256 (SEQ ID NO:166), GHPO 1264 (SEQ ID NO:168), GHPO 1316 (SEQ ID 
NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID NO: 174), GHPO 1506 
(SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO: 180), 

15 GHPO 1627 (SEQ ID NO:182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID 
NO: 186), GHPO 1694 (SEQ ID NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 
(SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 
109 (SEQ ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), 
GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 (SEQ ID 

20 NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 

(SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 
879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), 
GHPO 893 (SEQ ID NO:226), GHPO 894 (SEQ ID NO:228), GHPO 976 (SEQ ID 
NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 

2 5 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), 

GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID 
NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), GHPO 1446 
(SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), 

- 7 - 




GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID 
NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 
(SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 
142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), 
5 GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID 
NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 
(SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 
602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 646 (SEQ ID NO:300), 
GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID NO:304), GHPO 670 (SEQ ID ' 

10 NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1 102 
(SEQ ID NO:312), GHPO 1114 (SEQ ID NO:314), GHPO 1152 (SEQ ID NO:316), 
GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID 
NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 
(SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), 

1 5 GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID 
NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 
(SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), 
GHPO 1 138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID 
NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 

2 0 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), 

GHPO 1 141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), and GHPO 1608 (SEQ 
ID NO:370), have been identified in the H. pylori genome. These polypeptides can be 
used, for example, in vaccination methods for preventing or treating Helicobacter 
infection. Some of the new polypeptides are secreted polypeptides that can be produced 

2 5 in their mature forms {i.e., as polypeptides that have been exported through class II or 
class III secretion pathways) or as precursors that include signal peptides, which can be 
removed in the course of excretion/secretion by cleavage at the N-terminal end of the 
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mature form. (The cleavage site is located at the C-terminal end of the signal peptide, 
adjacent to the mature form.) 

According to a first aspect of the invention, there are provided isolated 
polynucleotides that encode the precursor and mature forms of the Helicobacter GHPO 
5 proteins listed above (e.g. , GHPO 35 (SEQ ID NO: 1 ), GHPO 55 (SEQ ID NO:3), GHPO 
78 (SEQ ID NO:5), GHPO 89 (SEQ ID NO:7), GHPO 129 (SEQ ID NO:9), GHPO 541 
(SEQ ID NO:l 1), GHPO 607 (SEQ ID NO: 13), GHPO 635 (SEQ ID NO: 15), GHPO 701 
(SEQ ID NO: 17), GHPO 712 (SEQ ID NO: 19), GHPO 761 (SEQ ID NO:21), GHPO 838 
(SEQ ID NO:23), GHPO 1034 (SEQ ID NO:25), GHPO 1085 (SEQ ID NO:27), GHPO 

10 1213 (SEQ ID NO:29), GHPO 1255 (SEQ ID NO:31), GHPO 1308 (SEQ ID NO:33), 
GHPO 1389 (SEQ ID NO:35), GHPO 1706 (SEQ ID NO:37), GHPO 234 (SEQ ID 
NO:39), GHPO 314 (SEQ ID NO:41), GHPO 510 (SEQ ID NO:43), GHPO 603 (SEQ ID 
NO:45), GHPO 937 (SEQ ID NO:47), GHPO 1027 (SEQ ID NO:49), GHPO 1099 (SEQ 
ID NO:51), GHPO 1151 (SEQ ID NO:53), GHPO 1275 (SEQ ID NO:55), GHPO 1365 

15 (SEQ ID NO:57), GHPO 1578 (SEQ ID NO:59), GHPO 22 (SEQ ID NO:61), GHPO 58 
(SEQ ID NO:63), GHPO 200 (SEQ ID NO:65), GHPO 558 (SEQ ID NO:67), GHPO 563 
(SEQ ID NO:69), GHPO 695 (SEQ ID NO:71), GHPO 699 (SEQ ID NO:73), GHPO 702 
(SEQ ID NO:75), GHPO 709 (SEQ ID NO:77), GHPO 741 (SEQ ID NO:79), GHPO 762 
(SEQ ID NO:81), GHPO 827 (SEQ ID NO:83), GHPO 852 (SEQ ID NO:85), GHPO 

20 1013 (SEQ ID NO:87), GHPO 1020 (SEQ ID NO:89), GHPO 1031 (SEQ ID NO:91), 
GHPO 1052 (SEQ ID NO:93), GHPO 1 127 (SEQ ID NO:95), GHPO 1 149 (SEQ ID 
NO:97), GHPO 1176 (SEQ ID NO:99), GHPO 1250 (SEQ ID NO: 101), GHPO 1312 
(SEQ ID NO: 103), GHPO 1358 (SEQ ID NO: 105), GHPO 1490 (SEQ ID NO: 107), 
GHPO 1559 (SEQ ID NO: 109), GHPO 1651 (SEQ ID NO: 111), GHPO 1726 (SEQ ID 

25 NO:113), GHPO 1780 (SEQ ID NO:115), GHPO 895 (SEQ ID NO:117), GHPO 1447 
(SEQ ID NO: 119), GHPO 28 (SEQ ID NO: 121), GHPO 86 (SEQ ID NO: 123), GHPO 
155 (SEQ ID NO: 125), GHPO 157 (SEQ ID NO: 127), GHPO 237 (SEQ ID NO: 129), 
GHPO 290 (SEQ ID NO: 131), GHPO 293 (SEQ ID NO: 133), GHPO 335 (SEQ ID 

- 9 - 



NO: 135), GHPO 374 (SEQ ID NO:137), GHPO 442 (SEQ ID NO: 139), GHPO 480 
(SEQ ID NO: 141), GHPO 523 (SEQ ID NO: 143), GHPO 610 (SEQ ID NO: 145), GHPO 
675 (SEQ ID NO: 147), GHPO 690 (SEQ ID NO: 149), GHPO 829 (SEQ ID NO: 151), 
GHPO 850 (SEQ ID NO: 153), GHPO 876 (SEQ ID NO: 155), GHPO 984 (SEQ ID 
5 NO: 157), GHPO 989 (SEQ ID NO:159), GHPO 1111 (SEQ ID NO: 161), GHPO 1 145 
(SEQ ID NO: 163), GHPO 1256 (SEQ ID NO: 165), GHPO 1264 (SEQ ID NO: 167), 
GHPO 1316 (SEQ ID NO: 169), GHPO 1368 (SEQ ID NO: 171), GHPO 1442 (SEQ ID 
NO:173), GHPO 1506 (SEQ ID NO: 175), GHPO 1543 (SEQ ID NO: 177), GHPO 1574 
(SEQ ID NO: 179), GHPO 1627 (SEQ ID NO:181), GHPO 1657 (SEQ ID NO: 183), 

1 0 GHPO 1 664 (SEQ ID NO: 1 85), GHPO 1 694 (SEQ ID NO: 1 87), GHPO 1 704 (SEQ ID 
NO: 189), GHPO 1763 (SEQ ID NO: 191), GHPO 616 (SEQ ID NO: 193), GHPO 76 
(SEQ ID NO: 195), GHPO 109 (SEQ ID NO: 197), GHPO 163 (SEQ ID NO: 199), GHPO 
169 (SEQ ID NO:201), GHPO 208 (SEQ ID NO:203), GHPO 219 (SEQ ID NO:205), 
GHPO 445 (SEQ ID NO:207), GHPO 479 (SEQ ID NO:209), GHPO 525 (SEQ ID 

15 NO:21 1), GHPO 535 (SEQ ID NO:213), GHPO 731 (SEQ ID NO:215), GHPO 836 

(SEQ ID NO:217), GHPO 879 (SEQ ID NO:219), GHPO 881 (SEQ ID NO:221), GHPO 
886 (SEQ ID NO:223), GHPO 893 (SEQ ID NO:225), GHPO 894 (SEQ ID NO:227), 
GHPO 976 (SEQ ID NO:229), GHPO 101 1 (SEQ ID NO:231), GHPO 1024 (SEQ ID 
NO:233), GHPO 1084 (SEQ ID NO:235), GHPO 1329 (SEQ ID NO:237), GHPO 1330 

2 0 (SEQ ID NO:239), GHPO 1346 (SEQ ID NO:241), GHPO 1360 (SEQ ID NO:243), 

GHPO 1388 (SEQ ID NO:245), GHPO 1411 (SEQ ID NO:247), GHPO 1419 (SEQ ID 
NO:249), GHPO 1446 (SEQ ID NO:251), GHPO 1469 (SEQ ID NO:253), GHPO 1501 
(SEQ ID NO:255), GHPO 1505 (SEQ ID NO:257), GHPO 1522 (SEQ ID NO:259), 
GHPO 1525 (SEQ ID NO:261), GHPO 1615 (SEQ ID NO:263), GHPO 1689 (SEQ ID 

2 5 NO:265), GHPO 1733 (SEQ ID NO:267), GHPO 18 (SEQ ID NO:269), GHPO 139 

(SEQ ID NO:271), GHPO 142 (SEQ ID NO:273), GHPO 250 (SEQ ID NO:275), GHPO 
257 (SEQ ID NO:277), GHPO 325 (SEQ ID NO:279), GHPO 355 (SEQ ID NO:281), 
GHPO 357 (SEQ ID NO:283), GHPO 454 (SEQ ID NO:285), GHPO 475 (SEQ ID 



• 



NO:287), GHPO 515 (SEQ ID NO:289), GHPO 527 (SEQ ID NO:291), GHPO 551 
(SEQ ID NO:293), GHPO 602 (SEQ ID NO:295), GHPO 626 (SEQ ID NO:297), GHPO 
646 (SEQ ID NO:299), GHPO 653 (SEQ ID NO:301), GHPO 655 (SEQ ID NO:303), 
GHPO 670 (SEQ ID NO:305), GHPO 739 (SEQ ID NO:307), GHPO 798 (SEQ ID 
5 NO:309), GHPO 1102 (SEQ ID NO:3 11), GHPO 1114 (SEQ ID NO:3 13), GHPO 1152 
(SEQ ID NO:315), GHPO 1272 (SEQ ID NO:317), GHPO 1345 (SEQ ID NO:319), 
GHPO 1377 (SEQ ID NO:321), GHPO 1424 (SEQ ID NO:323), GHPO 1430 (SEQ ID 
NO:325), GHPO 1502 (SEQ ID NO:327), GHPO 1600 (SEQ ID NO:329), GHPO 1714 
(SEQ ID NO:331), GHPO 359 (SEQ ID NO:333), GHPO 678 (SEQ ID NO:335), GHPO 

1 0 708 (SEQ ID NO:337), GHPO 759 (SEQ ID NO:339), GHPO 847 (SEQ ID NO:341), 
GHPO 1050 (SEQ ID NO:343), GHPO 1101 (SEQ ID NO:345), GHPO 1120 (SEQ ID 
NO:347), GHPO 1 138 (SEQ ID NO:349), GHPO 1310 (SEQ ID NO:351), GHPO 1320 
(SEQ ID NO:353), GHPO 1375 (SEQ ID NO:355), GHPO 1432 (SEQ ID NO:357), 
GHPO 21 (SEQ ID NO:359), GHPO 282 (SEQ ID NO:361), GHPO 1089 (SEQ ID 

1 5 NO:363), GHPO 1 141 (SEQ ID NO:365), GHPO 1280 (SEQ ID NO:367), and GHPO 
1608 (SEQ ID NO:369)). 

An isolated polynucleotide of the invention encodes (i) a polypeptide having 
an amino acid sequence that is homologous to a Helicobacter amino acid sequence of a 
polypeptide, the Helicobacter amino acid sequence being selected from the group 

2 0 consisting of the amino acid sequences shown in the sequence listing, or (ii) a derivative 
of the polypeptide. 

In addition to the full-length polypeptides encoded by the polynucleotides of 
the invention, as set forth above, polynucleotides included in the invention can also 
encode polypeptides that lack signal sequences, as well as other polypeptide or peptide 
2 5 fragments of the full-length polypeptides. 

The term "isolated polynucleotide" is defined as a polynucleotide that is 
removed from the environment in which it naturally occurs. For example, a naturally- 
occurring DNA molecule present in the genome of a living bacteria or as part of a gene 




bank is not isolated, but the same molecule, separated from the remaining part of the 
bacterial genome, as a result of, e.g., a cloning event (amplification), is "isolated." 
Typically, an isolated DNA molecule is free from DNA regions (e.g., coding regions) 
with which it is immediately contiguous, at the 5* or 3* ends, in the naturally occurring 
5 genome. Such isolated polynucleotides can be part of a vector or a composition and still 
be isolated, as such a vector or composition is not part of its natural environment. 

A polynucleotide of the invention can consist of RNA or DNA (e.g., cDNA, 
genomic DNA, or synthetic DNA), or modifications or combinations of RNA or DNA. 
The polynucleotide can be double-stranded or single-stranded and, if single-stranded, can 

10 be the coding (sense) strand or the non-coding (anti-sense) strand. The sequences that 
encode polypeptides of the invention, as shown in the sequence listing, can be (a) the 
coding sequence as shown in any of the nucleotide sequences of the sequence listing (b) a 
ribonucleotide sequence derived by transcription of (a); or (c) a different coding sequence 
that, as a result of the redundancy or degeneracy of the genetic code, encodes the same 

1 5 polypeptides as the polynucleotide molecules having the sequences illustrated in any of 
the nucleotide sequences of the sequence listing. The polypeptide can be one that is 
naturally secreted or excreted by, e.g., H.felis, H. mustelae, H. heilmanii, or H. pylori. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of 
length or post-translational modification (e.g., glycosylation or phosphorylation). Both 

2 0 terms are used interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence that 
differs from an amino acid sequence shown in the sequence listing, or an amino acid 
sequence encoded by a nucleotide sequence shown in the sequence listing, by one or more 
non-conservative amino acid substitutions, deletions, or additions located at positions at 

2 5 which they do not destroy the specific antigenicity of the polypeptide. Preferably, such a 
sequence is at least 75%, more preferably at least 80%, and most preferably at least 90% 
identical to an amino acid sequence shown in the sequence listing. 
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Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in the sequence listing. By 
"amino acid sequence that is substantially identical" is meant a sequence that is at least 
90%, preferably at least 95%, more preferably at least 97%, and most preferably at least 
5 99% identical to an amino acid sequence of reference and that differs from the sequence 
of reference, if at all, by a majority of conservative amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among 
amino acids of the same class. These classes include, for example, amino acids having 
uncharged polar side chains, such as asparagine, glutamine, serine, threonine, and 
10 tyrosine; amino acids having basic side chains, such as lysine, arginine, and histidine; 

amino acids having acidic side chains, such as aspartic acid and glutamic acid; and amino 
acids having nonpolar side chains, such as glycine, alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan, and cysteine. 

Homology can be measured using sequence analysis software (e.g., Sequence 
1 5 Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino 
acid sequences are aligned to obtain the maximum degree of homology (i.e., identity). To 
this end, it may be necessary to artificially introduce gaps into the sequence. Once the 
optimal alignment has been set up, the degree of homology (i.e., identity) is established 
2 0 by recording all of the positions in which the amino acids of both sequences are identical, 
relative to the total number of positions. 

Homologous polynucleotide sequences are defined in a similar way. 
Preferably, a homologous sequence is one that is at least 45%, more preferably at least 
60%, and most preferably at least 85% identical to a coding sequence of any of the 
2 5 nucleotide sequences set forth in the sequence listing. 

Polypeptides having a sequence homologous to any one of the sequences 
shown in the sequence listing, include naturally-occurring allelic variants, as well as 
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mutants or any other non-naturally occurring variants that are analogous in terms of 
antigenicity, to a polypeptide having a sequence as shown in the sequence listing. 

As is known in the art, an allelic variant is an alternate form of a polypeptide 
that is characterized as having a substitution, deletion, or addition of one or more amino 
5 acids that does not alter the biological function of the polypeptide. By "biological 

function" is meant a function of the polypeptide in the cells in which it naturally occurs, 
even if the function is not necessary for the growth or survival of the cells. For example, 
the biological function of a porin is to allow the entry into cells of compounds present in 
the extracellular medium. The biological function is distinct from the antigenic function. 

10 A polypeptide can have more than one biological function. 

Allelic variants are very common in nature. For example, a bacterial species, 
e.g., H. pylori, is usually represented by a variety of strains that differ from each other by 
minor allelic variations. Indeed, a polypeptide that fulfills the same biological function in 
different strains can have an amino acid sequence that is not identical in each of the 

15 strains. Such an allelic variation can be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes from, 
e.g., studies of the Helicobacter urease antigen. The amino acid sequence of Helicobacter 
urease varies widely from species to species, yet cross-species protection occurs, 
indicating that the urease molecule, when used as an immunogen, is highly tolerant of 

2 0 amino acid variations. Even among different strains of the single species H. pylori, there 
are amino acid sequence variations. 

For example, although the amino acid sequences of the UreA and UreB 
subunits of H. pylori and H.felis ureases differ from one another by 26.5% and 1 1.8%, 
respectively (Ferrero et al., Molecular Microbiology 9(2):323-333, 1993), it has been 

2 5 shown that H. pylori urease protects mice from H. felis infection (Michetti et ah, 

Gastroenterology 107:1002, 1994). In addition, it has been shown that the individual 
structural subunits of urease, UreA and UreB, which contain distinct amino acid 
sequences, are both protective antigens against Helicobacter infection (Michetti et al., 
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supra). Similarly, Cuenca et al. (Gastroenterology 110:1770, 1996) showed that 
therapeutic immunization of H. mustelae-infected ferrets with H. pylori urease was 
effective at eradicating H. mustelae infection. Further, several urease variants have been 
reported to be effective vaccine antigens, including, e.g., recombinant UreA + UreB 
5 apoenzyme expressed from pORV142 (UreA and UreB sequences derived from H. pylori 
strain CPM630; Lee et al, J. Infect. Dis.l72:161, 1995); recombinant UreA + UreB 
apoenzyme expressed from pORV214 (UreA and UreB sequences differ from H. pylori 
strain CPM630 by one and two amino acid changes, respectively; Lee et al, supra, 1995); 
a UreA-glutathione-S-transferase fusion protein (UreA sequence from H. pylori strain 

10 ATCC 43504; Thomas et al, Acta Gastro-Enterologica Belgica 56:54, 1993); UreA + 
UreB holoenzyme purified from H. pylori strain NCTC1 1637 (Marchetti et al, Science 
267:1655, 1995); a UreA-MBP fusion protein (UreA from H. pylori strain 85P; Ferrero et 
al, Infection and Immunity 62:4981, 1994); a UreB-MBP fusion protein (UreB from 
H. pylori strain 85P; Ferrero et al, supra); a UreA-MBP fusion protein (UreA from H. 

15 felis strain ATCC 49179; Ferrero et al, supra); a UreB-MBP fusion protein (UreB from 
H. felis strain ATCC 49179; Ferrero et al, supra); and a 37 kDa fragment of UreB 
containing amino acids 220-569 (Dore-Davin et al, "A 37 kD fragment of UreB is 
sufficient to confer protection against Helicobacter felis infection in mice"). Finally, 
Thomas et al {supra) showed that oral immunization of mice with crude sonicates of 

20 H. pylori protected mice from subsequent challenge with H. felis. 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can easily be 
obtained by polymerase chain reaction (PCR) amplification of genomic bacterial DNA 
extracted by conventional methods. This involves the use of synthetic oligonucleotide 
primers matching sequences that are upstream and downstream of the 5' and 3' ends of the 

2 5 coding region. Suitable primers can be designed based on the nucleotide sequence 

information provided in the sequence listing. Typically, a primer consists of 10 to 40, 
preferably 15 to 25 nucleotides. It can also be advantageous to select primers containing 
C and G nucleotides in proportions sufficient to ensure efficient hybridization, e.g., an 




amount of C and G nucleotides of at least 40%, preferably 50%, of the total nucleotide 
amount. Those skilled in the art can readily design primers that can be used to isolate the 
polynucleotides of the invention from different Helicobacter strains. Experimental 
conditions for carrying out PCR can readily be determined by one skilled in the art and an 
5 illustration of carrying out PCR is provided in Example 2. As is well known in the art, 
restriction endonuclease recognition sites that contain, typically, 4 to 6 nucleotides (for 
example, the sequences 5'-GGATCC-3' (BamHl) or 5-CTCGAG-3' (Xho\)\ can be 
included on the 5* ends of the primers. Restriction sites can be selected by those skilled in 
the art so that the amplified DNA can be conveniently cloned into an appropriately 

1 0 digested vector, such as a plasmid. 

Useful homologs that do not occur naturally can be designed using known 
methods for identifying regions of an antigen that are likely to be tolerant of amino acid 
sequence changes and/or deletions. For example, sequences of the antigen from different 
species can be compared to identify conserved sequences. 

15 Polypeptide derivatives that are encoded by polynucleotides of the invention 

include, e.g., fragments, polypeptides having large internal deletions derived from full- 
length polypeptides, and fusion proteins. Polypeptide fragments of the invention can be 
derived from a polypeptide having a sequence homologous to any of the sequences of the 
sequence listing, to the extent that the fragments retain the substantial antigenicity of the 

20 parent polypeptide (specific antigenicity). Polypeptide derivatives can also be 

constructed by large internal deletions that remove a substantial part of the parent 
polypeptide, while retaining specific antigenicity. Generally, polypeptide derivatives 
should be about at least 12 amino acids in length to maintain antigenicity. 
Advantageously, they can be at least 20 amino acids, preferably at least 50 amino acids, 

2 5 more preferably at least 75 amino acids, and most preferably at least 100 amino acids in 
length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed 
using computer-assisted analysis of amino acid sequences in order to identify sites in 




protein antigens having potential as surface-exposed, antigenic regions (Hughes et al., 
Infect. Immun. 60(9):3497, 1992). For example, the Laser Gene Program from DNA Star 
can be used to obtain hydrophilicity, antigenic index, and intensity index plots for the 
polypeptides of the invention. This program can also be used to obtain information about 
5 homologies of the polypeptides with known protein motifs. One skilled in the art can 
readily use the information provided in such plots to select peptide fragments for use as 
vaccine antigens. For example, fragments spanning regions of the plots in which the 
antigenic index is relatively high can be selected. One can also select fragments spanning 
regions in which both the antigenic index and the intensity plots are relatively high. 

1 0 Fragments containing conserved sequences, particularly hydrophilic conserved sequences, 
can also be selected. 

Polypeptide fragments and polypeptides having large internal deletions can be 
used for revealing epitopes that are otherwise masked in the parent polypeptide and that 
may be of importance for inducing a protective T cell-dependent immune response. 

15 Deletions can also remove immunodominant regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments and 
variants of protein immunogens as vaccines, as all that is required to induce an immune 
response to a protein is a small (e.g., 8 to 10 amino acids) immunogenic region of the 
protein. This has been done for a number of vaccines against pathogens other than 

2 0 Helicobacter. For example, short synthetic peptides corresponding to surface-exposed 
antigens of pathogens such as murine mammary tumor virus (peptide containing 1 1 
amino acids; Dion et ah, Virology 179:474-477, 1990), Semliki Forest virus (peptide 
containing 16 amino acids; Snijders et al., J. Gen. Virol. 72:557-565, 1991), and canine 
parvovirus (2 overlapping peptides, each containing 15 amino acids; Langeveld et ah, 

2 5 Vaccine 12(15): 1473-1480, 1994) have been shown to be effective vaccine antigens 
against their respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides having 
large internal deletions can be constructed using standard methods (see, e.g., Ausubel et 
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aL, Current Protocols in Molecular Biology, John Wiley & Sons Inc., 1994), for 
example, by PCR, including inverse PCR, by restriction enzyme treatment of the cloned 
DNA molecules, or by the method of Kunkel et aL (Proc. Natl. Acad. Sci. USA 82:448, 
1985; biological material available at Stratagene). 
5 A polypeptide derivative can also be produced as a fusion polypeptide that 

contains a polypeptide or a polypeptide derivative of the invention fused, e.g., at the N- 
or C-terminal end, to any other polypeptide (hereinafter referred to as a peptide tail). 
Such a product can be easily obtained by translation of a genetic fusion, i.e., a hybrid 
gene. Vectors for expressing fusion polypeptides are commercially available, and include 
1 0 the pMal-c2 or pMal-p2 systems of New England Biolabs, in which the peptide tail is a 
maltose binding protein, the glutathione-S-transferase system of Pharmacia, or the His- 
Tag system available from Novagen. These and other expression systems provide 
convenient means for further purification of polypeptides and derivatives of the 
invention. 

1 5 Another particular example of fusion polypeptides included in invention 

includes a polypeptide or polypeptide derivative of the invention fused to a polypeptide 
having adjuvant activity, such as, e.g., subunit B of either cholera toxin or E. coli heat- 
labile toxin. Several possibilities can be used for producing such fusion proteins. First, 
the polypeptide of the invention can be fused to the N-terminal end or, preferably, to the 

2 0 C-terminal end of the polypeptide having adjuvant activity. Second, a polypeptide 

fragment of the invention can be fused within the amino acid sequence of the polypeptide 
having adjuvant activity. Spacer sequences can also be included, if desired. 

As stated above, the polynucleotides of the invention encode Helicobacter 
polypeptides in precursor or mature form. They can also encode hybrid precursors 

2 5 containing heterologous signal peptides, which can mature into polypeptides of the 

invention. By "heterologous signal peptide" is meant a signal peptide that is not found in 
the naturally-occurring precursor of a polypeptide of the invention. 
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A polynucleotide of the invention hybridizes, preferably under stringent 
conditions, to a polynucleotide having a sequence as shown in the sequence listing. 
Hybridization procedures are, e.g., described by Ausubel et al. (supra); Silhavy et al. 
(Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold Spring 
5 Harbor, New York, 1984); and Davis et al. (A Manual for Genetic Engineering; 

Advanced Bacterial Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, 1980). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in the following formula, which facilitates 
calculation of the melting temperature (Tm), which is the temperature above which two 

10 complementary DNA strands separate from one another (Casey et aL, Nucl. Acid Res. 

4:1539, 1977): Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion concentration) - 0.6 x 
(% formamide). Under appropriate stringency conditions, hybridization temperature (Th) 
is approximately 20 to 40 °C, 20 to 25 °C, or, preferably, 30 to 40 °C below the calculated 
Tm. Those skilled in the art will understand that optimal temperature and salt conditions 

15 can be readily determined empirically in preliminary experiments using conventional 

procedures. For example, stringent conditions can be achieved, both for pre-hybridizing 
and hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 
50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC solution (1 M 
NaCl, 0.1 M sodium citrate (pH 7.0)). For polynucleotides containing 30 to 

2 0 600 nucleotides, the above formula is used and then is corrected by subtracting 

(600/polynucleotide size in base pairs). Stringency conditions are defined by a Th that is 
5 to 10°C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases do 
not precisely follow the rules set forth above. In such cases, the formula for calculating 

2 5 the Tm is as follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide 
fragment of 50% G+C would have an approximate Tm of 54 °C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or 
modifications or combinations thereof, can have various applications. For example, a 
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polynucleotide molecule can be used (i) in a process for producing the encoded 
polypeptide in a recombinant host system, (ii) in the construction of vaccine vectors such 
as poxviruses, which are further used in methods and compositions for preventing and/or 
treating Helicobacter infection, (iii) as a vaccine agent, in a naked form or formulated 
5 with a delivery vehicle and, (iv) in the construction of attenuated Helicobacter strains that 
can over-express a polynucleotide of the invention or express it in a non-toxic, mutated 
form. 

According to a second aspect of the invention, there is therefore provided (i) 
an expression cassette containing a polynucleotide molecule of the invention placed under 

10 the control of elements (e.g., a promoter) required for expression; (ii) an expression 

vector containing an expression cassette of the invention; (iii) a procaryotic or eucaryotic 
cell transformed or transfected with an expression cassette and/or vector of the invention, 
as well as (iv) a process for producing a polypeptide or polypeptide derivative encoded by 
a polynucleotide of the invention, which involves culturing a procaryotic or eucaryotic 

1 5 cell transformed or transfected with an expression cassette and/or vector of the invention, 
under conditions that allow expression of the polynucleotide molecule of the invention 
and, recovering the encoded polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and 
eucaryotic hosts. Eucaryotic hosts include, for example, yeast cells (e.g., Saccharomyces 

2 0 cerevisiae or Pichia Pastoris), mammalian cells (e.g., COS1, NIH3T3, or JEG3 cells), 
arthropods cells (e.g., Spodoptera frugiperda (SF9) cells), and plant cells. Preferably, a 
procaryotic host such as E. coli is used. Bacterial and eucaryotic cells are available from 
a number of different sources that are known to those skilled in the art, e.g., the American 
Type Culture Collection (ATCC; Rockville, Maryland). 

2 5 The choice of the expression cassette will depend on the host system selected, 

as well as the features desired for the expressed polypeptide. For example, it may be 
useful to produce a polypeptide of the invention in a particular lipidated form or any other 
form. Typically, an expression cassette includes a constitutive or inducible promoter that 




is functional in the selected host system; a ribosome binding site; a start codon (ATG); if 
necessary, a region encoding a signal peptide, e.g., a lipidation signal peptide; a 
polynucleotide molecule of the invention; a stop codon; and, optionally, a 3' terminal 
region (translation and/or transcription terminator). The signal peptide-encoding region is 
5 adjacent to the polynucleotide of the invention and is placed in the proper reading frame. 
The signal peptide-encoding region can be homologous or heterologous to the 
polynucleotide molecule encoding the mature polypeptide and it can be specific to the 
secretion apparatus of the host used for expression. The open reading frame constituted 
by the polynucleotide molecule of the invention, alone or together with the signal peptide, 

10 is placed under the control of the promoter so that transcription and translation occur in 
the host system. Promoters and signal peptide-encoding regions are widely known and 
available to those skilled in the art and include, for example, the promoter of Salmonella 
typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is 
functional in Gram-negative bacteria such as E. coli (U.S. Patent No. 5,028,530; Cagnon 

15 et aL, Protein Engineering 4(7):843, 1991); the promoter of the bacteriophage T7 RNA 
polymerase gene, which is functional in a number of E. coli strains expressing T7 
polymerase (U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and RlpB 
lipidation signal peptide (Takase et aL, J. Bact. 169:5692, 1987). 

The expression cassette is typically part of an expression vector, which is 

2 0 selected for its ability to replicate in the chosen expression system. Expression vectors 
(e.g., plasmids or viral vectors) can be chosen from, for example, those described in 
Pouwels et ah (Cloning Vectors: A Laboratory Manual, 1985, Supp. 1987) and can 
purchased from various commercial sources. Methods for transforming or transfecting 
host cells with expression vectors are well known in the art and will depend on the host 

2 5 system selected, as described in Ausubel et aL (supra). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted 
in the extracellular medium or in the periplasmic space, or is embedded in the cellular 




membrane. The polypeptide can then be recovered in a substantially purified form from 
the cell extract or from the supernatant after centrifugation of the cell culture. Typically, 
the recombinant polypeptide can be purified by antibody-based affinity purification or by 
any other method known to a person skilled in the art, such as by genetic fusion to a small 
5 affinity-binding domain. Antibody-based affinity purification methods are also available 
for purifying a polypeptide of the invention extracted from a Helicobacter strain. 
Antibodies useful for immunoaffinity purification of the polypeptides of the invention can 
be obtained using methods described below. 

Polynucleotides of the invention can also be used in DNA vaccination 
1 0 methods, using either a viral or bacterial host as gene delivery vehicle (live vaccine 

vector) or administering the gene in a free form, e.g., inserted into a plasmid. Therapeutic 
or prophylactic efficacy of a polynucleotide of the invention can be evaluated as is 
described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine 
1 5 vector such as a poxvirus, containing a polynucleotide molecule of the invention placed 
under the control of elements required for expression; (ii) a composition of matter 
containing a vaccine vector of the invention, together with a diluent or carrier; (iii) a 
pharmaceutical composition containing a therapeutically or prophylactically effective 
amount of a vaccine vector of the invention; (iv) a method for inducing an immune 
2 0 response against Helicobacter in a mammal (e.g., a human; alternatively, the method can 
be used in veterinary applications for treating or preventing Helicobacter infection of 
animals, e.g., cats or birds), which involves administering to the mammal an 
immunogenically effective amount of a vaccine vector of the invention to elicit an 
immune response, e.g., a protective or therapeutic immune response to Helicobacter; and 
2 5 (v) a method for preventing and/or treating a Helicobacter (e.g., H. pylori, //. felis, 

H. mustelae, or H. heilmanii) infection, which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an individual in need. 
Additionally, the third aspect of the invention encompasses the use of a vaccine vector of 
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the invention in the preparation of a medicament for preventing and/or treating 
Helicobacter infection. 

A vaccine vector of the invention can express one or several polypeptides or 
derivatives of the invention, as well as at least one additional Helicobacter antigen such 
5 as a urease apoenzyme or a subunit, fragment, homolog, mutant, or derivative thereof. In 
addition, it can express a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), 
that enhances the immune response. Thus, a vaccine vector can include an additional 
polynucleotide molecules encoding, e.g., urease subunit A, B, or both, or a cytokine, 
placed under the control of elements required for expression in a mammalian cell. 

1 0 Alternatively, a composition of the invention can include several vaccine 

vectors, each of which being capable of expressing a polypeptide or derivative of the 
invention. A composition can also contain a vaccine vector capable of expressing an 
additional Helicobacter antigen such as urease apoenzyme, a subunit, fragment, homolog, 
mutant, or derivative thereof, or a cytokine such as IL-2 or IL-12. 

15 In vaccination methods for treating or preventing infection in a mammal, a 

vaccine vector of the invention can be administered by any conventional route in use in 
the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, oral, gastric, 
pulmonary, intestinal, rectal, vaginal, or urinary tract) surface or via a parenteral (e.g., 
subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. 

2 0 Preferred routes depend upon the choice of the vaccine vector. The administration can be 
achieved in a single dose or repeated at intervals. The appropriate dosage depends on 
various parameters that are understood by those skilled in the art, such as the nature of the 
vaccine vector itself, the route of administration, and the condition of the mammal to be 
vaccinated (e.g., the weight, age, and general health of the mammal). 

2 5 Live vaccine vectors that can be used in the invention include viral vectors, 

such as adenoviruses and poxviruses, as well as bacterial vectors, e.g., Shigella, 
Salmonella, Vibrio cholerae, Lactobacillus, Bacille bilie de Calmette-Guerin (BCG), and 
Streptococcus. An example of an adenovirus vector, as well as a method for constructing 
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an adenovirus vector capable of expressing a polynucleotide molecule of the invention, is 
described in U.S. Patent No. 4,920,209. Poxvirus vectors that can be used in the 
invention include, e.g., vaccinia and canary pox viruses, which are described in U.S. 
Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively (also see, e.g., Tartaglia 
5 et al., Virology 188:217, 1992, for a description of a vaccinia virus vector, and Taylor et 
al, Vaccine 13:539, 1995, for a description of a canary poxvirus vector). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by 
homologous recombination, as described in Kieny et al. (Nature 312:163, 1984) so that 
the polynucleotide of the invention is inserted in the viral genome under appropriate 

10 conditions for expression in mammalian cells. Generally, the dose of viral vector 

vaccine, for therapeutic or prophylactic use, can be from about lxlO 4 to about lxlO 11 , 
advantageously from about lxlO 7 to about lxlO 10 , or, preferably, from about lxlO 7 to 
about lxlO 9 plaque-forming units per kilogram. Preferably, viral vectors are administered 
parenterally, for example, in 3 doses that are 4 weeks apart. Those skilled in the art will 

15 recognize that it is preferable to avoid adding a chemical adjuvant to a composition 

containing a viral vector of the invention and thereby minimizing the immune response to 
the viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that can be used in live oral 
vaccines are described by Mekalanos et al. (Nature 306:551, 1983) and in U.S. Patent 

20 No. 4,882,278 (strain in which a substantial amount of the coding sequence of each of the 
two ctxA alleles has been deleted so that no functional cholerae toxin is produced); 
WO 92/1 1354 (strain in which the irgA locus is inactivated by mutation; this mutation 
can be combined in a single strain with ctxA mutations); and WO 94/1533 (deletion 
mutant lacking functional ctxA and attRSl DNA sequences). These strains can be 

2 5 genetically engineered to express heterologous antigens, as described in WO 94/19482. 
An effective vaccine dose of a V. cholerae strain capable of expressing a polypeptide or 
polypeptide derivative encoded by a polynucleotide molecule of the invention can 
contain, e.g., about lxlO 5 to about lxlO 9 , preferably about lxlO 6 to about 1x10 s viable 
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bacteria in an appropriate volume for the selected route of administration. Preferred 
routes of administration include all mucosal routes, but, most preferably, these vectors are 
administered intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically engineered for 
5 recombinant expression of heterologous antigens, and their use as oral vaccines, are 
described by Nakayama et al (Bio/Technology 6:693, 1988) and in WO 92/1 1361. 
Preferred routes of administration for these vectors include all mucosal routes. Most 
preferably, the vectors are administered intranasally or orally. 

Others bacterial strains useful as vaccine vectors are described by High et al. 

10 (EMBO 11:1991, 1992) and Sizemore et al (Science 270:299, 1995; Shigella flexneri); 
Medaglini et al. (Proc. Natl. Acad. Sci. USA 92:6868, 1995; {Streptococcus gordonii); 
Flynn (Cell. Mol. Biol. 40 (suppl. I):31, 1 194), and in WO 88/6626, WO 90/0594, WO 
91/13157, WO 92/1796, and WO 92/21376 (Bacille Calmette Guerin). In bacterial 
vectors, a polynucleotide of the invention can be inserted into the bacterial genome or it 

1 5 can remain in a free state, for example, carried on a plasmid. 

An adjuvant can also be added to a composition containing a bacterial vector 
vaccine. A number of adjuvants that can be used are known to those skilled in the art. 
For example, preferred adjuvants can be selected from the list provided below. 

According to a fourth aspect of the invention, there is also provided (i) a 

2 0 composition of matter containing a polynucleotide of the invention, together with a 
diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or 
prophylactically effective amount of a polynucleotide of the invention; (iii) a method for 
inducing an immune response against Helicobacter, in a mammal, by administering to the 
mammal an immunogenically effective amount of a polynucleotide of the invention to 

2 5 elicit an immune response, e.g., a protective immune response to Helicobacter; and (iv) a 
method for preventing and/or treating a Helicobacter {e.g., H. pylori, H. felis, H. 
mustelae, or H. heilmanii) infection, by administering a prophylactic or therapeutic 
amount of a polynucleotide of the invention to an individual in need of such treatment. 




Additionally, the fourth aspect of the invention encompasses the use of a polynucleotide 
of the invention in the preparation of a medicament for preventing and/or treating 
Helicobacter infection. The fourth aspect of the invention preferably includes the use of a 
polynucleotide molecule placed under conditions for expression in a mammalian cell, 
5 e.g., in a plasmid that is unable to replicate in mammalian cells and to substantially 
integrate into a mammalian genome. 

Polynucleotides (for example, DNA or RNA molecules) of the invention can 
also be administered as such to a mammal as a vaccine. When a DNA molecule of the 
invention is used, it can be in the form of a plasmid that is unable to replicate in a 

10 mammalian cell and unable to integrate into the mammalian genome. Typically, a DNA 
molecule is placed under the control of a promoter suitable for expression in a 
mammalian cell. The promoter can function ubiquitously or tissue-specifically. 
Examples of non-tissue specific promoters include the early Cytomegalovirus (CMV) 
promoter (U.S. Patent No. 4,168,062) and the Rous Sarcoma Virus promoter (Norton et 

15 al, Molec. Cell Biol. 5:281, 1985). The desmin promoter (Li et al, Gene 78:243, 1989; 
Li et al, J. Biol. Chem. 266:6562, 1991; Li et al, J. Biol. Chem. 268:10403, 1993) is 
tissue-specific and drives expression in muscle cells. More generally, useful promoters 
and vectors are described, e.g., in WO 94/21797 and by Hartikka et al (Human Gene 
Therapy 7:1205, 1996). 

2 0 For DNA/RNA vaccination, the polynucleotide of the invention can encode a 

precursor or a mature form of a polypeptide of the invention. When it encodes a 
precursor form, the precursor sequence can be homologous or heterologous. In the latter 
case, a eucaryotic leader sequence can be used, such as the leader sequence of the tissue- 
type plasminogen factor (tPA). 

2 5 A composition of the invention can contain one or several polynucleotides of 

the invention. It can also contain at least one additional polynucleotide encoding another 
Helicobacter antigen, such as urease subunit A, B, or both, or a fragment, derivative, 
mutant, or analog thereof. A polynucleotide encoding a cytokine, such as interleukin-2 




(IL-2) or interleukin-12 (IL-12), can also be added to the composition so that the immune 
response is enhanced. These additional polynucleotides are placed under appropriate 
control for expression. Advantageously, DNA molecules of the invention and/or 
additional DNA molecules to be included in the same composition are carried in the same 
5 plasmid. 

Standard methods can be used in the preparation of therapeutic 
polynucleotides of the invention. For example, a polynucleotide can be used in a naked 
form, free of any delivery vehicles, such as anionic liposomes, cationic lipids, 
microparticles, e.g., gold microparticles, precipitating agents, e.g., calcium phosphate, or 

1 0 any other transfection-facilitating agent. In this case, the polynucleotide can be simply 
diluted in a physiologically acceptable solution, such as sterile saline or sterile buffered 
saline, with or without a carrier. When present, the carrier preferably is isotonic, 
hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as provided 
by a sucrose solution, e.g., a solution containing 20% sucrose. 

15 Alternatively, a polynucleotide can be associated with agents that assist in 

cellular uptake. It can be, e.g., (i) complemented with a chemical agent that modifies 
cellular permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into 
liposomes, or (iii) associated with cationic lipids or silica, gold, or tungsten 
microparticles. 

2 0 Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: 

A Practical Approach, RPC New Ed, IRL Press, 1990, for a detailed description of 
methods for making liposomes) and are useful for delivering a large range of products, 
including polynucleotides. 

Cationic lipids can also be used for gene delivery. Such lipids include, for 

25 example, Lipofectin™, which is also known as DOTMA (N-[l-(2,3-dioleyloxy)propyl]- 
N,N,N-trimethylammonium chloride), DOTAP (l,2-bis(oleyloxy)-3- 
(trimethylammonio)propane), DDAB (dimethyldioctadecylammonium bromide), DOGS 
(dioctadecylamidologlycyl spermine), and cholesterol derivatives. A description of these 




cationic lipids can be found in EP 187,702, WO 90/11092, U.S. Patent No. 5,283,185, 
WO 91/15501, WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids for gene 
delivery are preferably used in association with a neutral lipid such as DOPE (dioleyl 
phosphatidylethanolamine; WO 90/1 1092). Other transfection-facilitating compounds 
5 can be added to a formulation containing cationic liposomes. A number of them are 

described in, e.g., WO 93/18759, WO 93/19768, WO 94/25608, and WO 95/2397. They 
include, e.g., spermine derivatives useful for facilitating the transport of DNA through the 
nuclear membrane (see, for example, WO 93/18759) and membrane-permeabilizing 
compounds such as GALA, Gramicidine S, and cationic bile salts (see, for example, 

10 WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as 
described in WO 91/359, WO 93/17706, and by Tang et al. (Nature 356: 152, 1992). In 
this case, the microparticle-coated polynucleotides can be injected via intradermal or 
intraepidermal routes using a needleless injection device ("gene gun"), such as those 

15 described in U.S. Patent No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the 
strength of the promoter used in the DNA construct, the immunogenicity of the expressed 
gene product, the condition of the mammal intended for administration {e.g., the weight, 
age, and general health of the mammal), the mode of administration, and the type of 

2 0 formulation. In general, a therapeutically or prophylactically effective dose from about 1 
yug to about 1 mg, preferably, from about 10 jug to about 800 //g, and, more preferably, 
from about 25 jxg to about 250 jxg, can be administered to human adults. The 
administration can be achieved in a single dose or repeated at intervals. 

The route of administration can be any conventional route used in the vaccine 

2 5 field. As general guidance, a polynucleotide of the invention can be administered via a 
mucosal surface, e,g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, or 
urinary tract surface, or via a parenteral route, e.g., by an intravenous, subcutaneous, 
intraperitoneal, intradermal, intraepidermal, or intramuscular route. The choice of 




administration route will depend on, e.g., the formulation that is selected. A 
polynucleotide formulated in association with bupivacaine is advantageously 
administered into muscle. When a neutral or anionic liposome or a cationic lipid, such as 
DOTMA, is used, the formulation can be advantageously injected via intravenous, 
5 intranasal (for example, by aerosolization), intramuscular, intradermal, and subcutaneous 
routes. A polynucleotide in a naked form can advantageously be administered via the 
intramuscular, intradermal, or subcutaneous routes. Although not absolutely required, 
such a composition can also contain an adjuvant. A systemic adjuvant that does not 
require concomitant administration in order to exhibit an adjuvant effect is preferable. 

10 The sequence information provided in the present application enables the 

design of specific nucleotide probes and primers that can be used in diagnostic methods. 
Accordingly, in a fifth aspect of the invention, there is provided a nucleotide probe or 
primer having a sequence found in, or derived by degeneracy of the genetic code from, a 
sequence shown in the sequence listing. 

1 5 The term "probe" as used in the present application refers to DNA (preferably 

single stranded) or RNA molecules (or modifications or combinations thereof) that 
hybridize under the stringent conditions, as defined above, to polynucleotide molecules 
having sequences homologous to any of those shown in the sequence listing, or to a 
complementary or anti-sense sequence of any of those shown in the sequence listing. 

2 0 Generally, probes are significantly shorter than the full-length sequences shown in the 
sequence listing. For example, they can contain from about 5 to about 100, preferably 
from about 10 to about 80 nucleotides. In particular, probes have sequences that are at 
least 75%, preferably at least 85%, more preferably 95% homologous to a portion of a 
sequence as shown in the sequence listing, or a sequence complementary to any of such 

2 5 sequences. 

Probes can contain modified bases, such as inosine, methyl-5-deoxycytidine, 
deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6-purine. Sugar or 
phosphate residues can also be modified or substituted. For example, a deoxyribose 
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residue can be replaced by a polyamide (Nielsen et aL, Science 254:1497, 1991) and 
phosphate residues can be replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate, and phosphorothioate esters. In addition, the 2'-hydroxyl group on 
ribonucleotides can be modified by addition of, e.g., alkyl groups. 
5 Probes of the invention can be used in diagnostic tests, or as capture or 

detection probes. Such capture probes can be immobilized on solid supports, directly or 
indirectly, by covalent means or by passive adsorption. A detection probe can be labeled 
by a detectable label, for example a label selected from radioactive isotopes; enzymes, 
such as peroxidase and alkaline phosphatase; enzymes that are able to hydrolyze a 

1 0 chromogenic, fluorogenic, or luminescent substrate; compounds that are chromogenic, 
fluorogenic, or luminescent; nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization method, 
such as in dot blot methods (Maniatis et aL, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1982), Southern 

15 blot methods (Southern, J. Mol. Biol. 98:503, 1975), northern blot methods (identical to 
Southern blot to the exception that RNA is used as a target), or a sandwich method (Dunn 
et ah. Cell 12:23, 1977). As is known in the art, the latter technique involves the use of a 
specific capture probe and a specific detection probe that have nucleotide sequences that 
are at least partially different from each other. 

2 0 Primers used in the invention usually contain about 10 to 40 nucleotides and 

are used to initiate enzymatic polymerization of DNA in an amplification process {e.g., 
PCR), an elongation process, or a reverse transcription method. In a diagnostic method 
involving PCR, the primers can be labeled. 

Thus, the invention also encompasses (i) a reagent containing a probe of the 

2 5 invention for detecting and/or identifying the presence of Helicobacter in a biological 

material; (ii) a method for detecting and/or identifying the presence of Helicobacter in a 
biological material, in which (a) a sample is recovered or derived from the biological 
material, (b) DNA or RNA is extracted from the material and denatured, and (c) the 




sample is exposed to a probe of the invention, for example, a capture probe, a detection 
probe, or both, under stringent hybridization conditions, so that hybridization is detected; 
and (iii) a method for detecting and/or identifying the presence of Helicobacter in a 
biological material, in which (a) a sample is recovered or derived from the biological 
5 material, (b) DNA is extracted therefrom, (c) the extracted DNA is contacted with at least 
one, or, preferably two, primers of the invention, and amplified by the polymerase chain 
reaction, and (d) an amplified DNA molecule is produced. 

As mentioned above, polypeptides that can be produced by expression of the 
polynucleotides of the invention can be used as vaccine antigens. Accordingly, a sixth 

1 0 aspect of the invention features a substantially purified polypeptide or polypeptide 

derivative having an amino acid sequence encoded by a polynucleotide of the invention. 

A "substantially purified polypeptide" is defined as a polypeptide that is 
separated from the environment in which it naturally occurs and/or a polypeptide that is 
free of most of the other polypeptides that are present in the environment in which it was 

1 5 synthesized. The polypeptides of the invention can be purified from a natural source, 
such as a Helicobacter strain, or can be produced using recombinant methods. 

Homologous polypeptides or polypeptide derivatives encoded by 
polynucleotides of the invention can be screened for specific antigenicity by testing cross- 
reactivity with an antiserum raised against a polypeptide having an amino acid sequence 

2 0 as shown in the sequence listing. Briefly, a monospecific hyperimmune antiserum can be 
raised against a purified reference polypeptide as such or as a fusion polypeptide, for 
example, an expression product of MBP, GST, or His-tag systems, or a synthetic peptide 
predicted to be antigenic. The homologous polypeptide or derivative that is screened for 
specific antigenicity can be produced as such or as a fusion polypeptide. In the latter 

2 5 case, and if the antiserum is also raised against a fusion polypeptide, two different fusion 
systems are employed. Specific antigenicity can be determined using a number of 
methods, including Western blot (Towbin et al. y Proc. Natl. Acad. Sci. USA 76:4350, 
1979), dot blot, and ELISA methods, as described below. 




In a Western blot assay, the product to be screened, either as a purified 
preparation or a total E. coli extract, is fractionated by SDS-PAGE, as described, for 
example, by Laemmli (Nature 227:680, 1970). After being transferred to a filter, such as 
a nitrocellulose membrane, the material is incubated with the monospecific hyperimmune 
5 antiserum, which is diluted in a range of dilutions from about 1:50 to about 1 :5000, 

preferably from about 1:100 to about 1:500. Specific antigenicity is shown once a band 
corresponding to the product exhibits reactivity at any of the dilutions in the range. 

In an ELISA assay, the product to be screened can be used as the coating 
antigen. A purified preparation is preferred, but a whole cell extract can also be used. 

10 Briefly, about 100 fxL of a preparation of about 10 jug protein/mL is distributed into wells 
of a 96-well ELISA plate. The plate is incubated for about 2 hours at 37 °C, then 
overnight at 4°C. The plate is washed with phosphate buffer saline (PBS) containing 
0.05% Tween 20 (PBS/Tween buffer) and the wells are saturated with 250 juL PBS 
containing 1% bovine serum albumin (BSA), to prevent non-specific antibody binding. 

15 After 1 hour of incubation at 37 °C, the plate is washed with PBS/Tween buffer. The 
antiserum is serially diluted in PBS/Tween buffer containing 0.5% BSA, and 100 //L 
dilutions are added to each well. The plate is incubated for 90 minutes at 37 °C, washed, 
and evaluated using standard methods. For example, a goat anti-rabbit peroxidase 
conjugate can be added to the wells when the specific antibodies used were raised in 

2 0 rabbits. Incubation is carried out for about 90 minutes at 37 °C and the plate is washed. 
The reaction is developed with the appropriate substrate and the reaction is measured by 
colorimetry (absorbance measured spectrophotometrically). Under these experimental 
conditions, a positive reaction is shown once an O.D. value of 1.0 is detected with a 
dilution of at least about 1:50, preferably of at least about 1:500. 

2 5 In a dot blot assay, a purified product is preferred, although a whole cell 

extract can be used. Briefly, a solution of the product at a concentration of about 100 
^ug/mL is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5). One hundred ,uL of 
each dilution is applied to a filter, such as a 0.45 /um nitrocellulose membrane, set in a 96- 
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well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to the 
system. Wells are washed by addition of 50 raM Tris-HCl (pH 7.5) and the membrane is 
air-dried. The membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5), 0.15 
M NaCl, 10 g/L skim milk) and incubated with an antiserum diluted from about 1:50 to 
5 about 1:5000, preferably about 1:500. The reaction is detected using standard methods. 
For example, a goat anti-rabbit peroxidase conjugate can be added to the wells when 
rabbit antibodies are used. Incubation is carried out for about 90 minutes at 37 °C and the 
blot is washed. The reaction is developed with the appropriate substrate and stopped. 
The reaction is then measured visually by the appearance of a colored spot, e.g., by 

1 0 colorimetry . Under these experimental conditions, a positive reaction is associated with 
detection of a colored spot for reactions carried out with a dilution of at least about 1:50, 
preferably, of at least about 1:500. Therapeutic or prophylactic efficacy of a polypeptide 
or polypeptide derivative of the invention can be evaluated as described below. 

According to a seventh aspect of the invention, there is provided (i) a 

1 5 composition of matter containing a polypeptide of the invention together with a diluent or 
carrier; (ii) a pharmaceutical composition containing a therapeutically or prophylactically 
effective amount of a polypeptide of the invention; (iii) a method for inducing an immune 
response against Helicobacter in a mammal by administering to the mammal an 
immunogenically effective amount of a polypeptide of the invention to elicit an immune 

2 0 response, e.g., a protective immune response to Helicobacter; and (iv) a method for 
preventing and/or treating a Helicobacter (e.g., H. pylori, H.felis, H. mustelae, or H. 
heilmanii) infection, by administering a prophylactic or therapeutic amount of a 
polypeptide of the invention to an individual in need of such treatment. Additionally, this 
aspect of the invention includes the use of a polypeptide of the invention in the 

2 5 preparation of a medicament for preventing and/or treating Helicobacter infection. 

The immunogenic compositions of the invention can be administered by any 
conventional route in use in the vaccine field, for example, to a mucosal (e.g., ocular, 
intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or 




via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or 
intraperitoneal) route. The choice of the administration route depends upon a number of 
parameters, such as the adjuvant used. For example, if a mucosal adjuvant is used, the 
intranasal or oral route will be preferred, and if a lipid formulation or an aluminum 
5 compound is used, a parenteral route will be preferred. In the latter case, the 

subcutaneous or intramuscular route is most preferred. The choice of administration 
route can also depend upon the nature of the vaccine agent. For example, a polypeptide 
of the invention fused to CTB or to LTB will be best administered to a mucosal surface. 

A composition of the invention can contain one or several polypeptides or 

10 derivatives of the invention. It can also contain at least one additional Helicobacter 
antigen, such as the urease apoenzyme, or a subunit, fragment, homolog, mutant, or 
derivative thereof. 

For use in a composition of the invention, a polypeptide or polypeptide 
derivative can be formulated into or with liposomes, such as neutral or anionic liposomes, 

15 microspheres, ISCOMS, or virus-like particles (VLPs), to facilitate delivery and/or 

enhance the immune response. These compounds are readily available to those skilled in 
the art; for example, see Liposomes: A Practical Approach (supra). Adjuvants other than 
liposomes can also be used in the invention and are well known in the art (see, for 
example, the list provided below). 

2 0 Administration can be achieved in a single dose or repeated as necessary at 

intervals that can be determined by one skilled in the art. For example, a priming dose 
can be followed by three booster doses at weekly or monthly intervals. An appropriate 
dose depends on various parameters, including the nature of the recipient (e.g., whether 
the recipient is an adult or an infant), the particular vaccine antigen, the route and 

2 5 frequency of administration, the presence/absence or type of adjuvant, and the desired 

effect (e.g., protection and/or treatment), and can be readily determined by one skilled in 
the art. In general, a vaccine antigen of the invention can be administered mucosally in 
an amount ranging from about 10 /^g to about 500 mg, preferably from about 1 mg to 




about 200 mg. For a parenteral route of administration, the dose usually should not 
exceed about 1 mg, and is, preferably, about 100 >ug. 

When used as components of a vaccine, the polynucleotides and polypeptides 
of the invention can be used sequentially as part of a multi-step immunization process. 
5 For example, a mammal can be initially primed with a vaccine vector of the invention, 
such as a pox virus, e.g., via a parenteral route, and then boosted twice with a polypeptide 
encoded by the vaccine vector, e.g., via the mucosal route. In another example, 
liposomes associated with a polypeptide or polypeptide derivative of the invention can be 
used for priming, with boosting being carried out mucosally using a soluble polypeptide 
10 or polypeptide derivative of the invention, in combination with a mucosal adjuvant {e.g., 
LT). 

Polypeptides and polypeptide derivatives of the invention can also be used as 
diagnostic reagents for detecting the presence of anti-Helicobacter antibodies, e.g., in 
blood samples. Such polypeptides can be about 5 to about 80, preferably, about 10 to 

1 5 about 50 amino acids in length and can be labeled or unlabeled, depending upon the 

diagnostic method. Diagnostic methods involving such a reagent are described below. 

Upon expression of a polynucleotide molecule of the invention, a polypeptide 
or polypeptide derivative is produced and can be purified using known methods. For 
example, the polypeptide or polypeptide derivative can be produced as a fusion protein 

2 0 containing a fused tail that facilitates purification. The fusion product can be used to 
immunize a small mammal, e.g., a mouse or a rabbit, in order to raise monospecific 
antibodies against the polypeptide or polypeptide derivative. The eighth aspect of the 
invention thus provides a monospecific antibody that binds to a polypeptide or 
polypeptide derivative of the invention. 

2 5 By "monospecific antibody" is meant an antibody that is capable of reacting 

with a unique, naturally-occurring Helicobacter polypeptide. An antibody of the 
invention can be polyclonal or monoclonal. Monospecific antibodies can be recombinant, 
e.g., chimeric (e.g., consisting of a variable region of murine origin and a human constant 
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region), humanized (e.g., a human immunoglobulin constant region and a variable region 
of animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific 
antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab 
fragments. The antibodies of the invention can be of any isotype, e.g., IgG or IgA, and 
5 polyclonal antibodies can be of a single isotype or can contain a mixture of iso types. 

The antibodies of the invention, which can be raised to a polypeptide or 
polypeptide derivative of the invention, can be produced and identified using standard 
immunological assays, e.g., Western blot assays, dot blot assays, or ELISA (see, e.g., 
Coligan et aL, Current Protocols in Immunology, John Wiley & Sons, Inc., New York, 

10 NY, 1994). The antibodies can be used in diagnostic methods to detect the presence of 
Helicobacter antigens in a sample, such as a biological sample. The antibodies can also 
be used in affinity chromatography methods for purifying a polypeptide or polypeptide 
derivative of the invention. As is discussed further below, the antibodies can also be used 
in prophylactic and therapeutic passive immunization methods. 

1 5 Accordingly, a ninth aspect of the invention provides (i) a reagent for 

detecting the presence of Helicobacter in a biological sample that contains an antibody, 
polypeptide, or polypeptide derivative of the invention; and (ii) a diagnostic method for 
detecting the presence of Helicobacter in a biological sample, by contacting the biological 
sample with an antibody, a polypeptide, or a polypeptide derivative of the invention, so 

2 0 that an immune complex is formed, and detecting the complex as an indication of the 
presence of Helicobacter in the sample or the organism from which the sample was 
derived. The immune complex is formed between a component of the sample and the 
antibody, polypeptide, or polypeptide derivative, and that any unbound material can be 
removed prior to detecting the complex. A polypeptide reagent can be used for detecting 

2 5 the presence of anti-Helicobacter antibodies in a sample, e.g., a blood sample, while an 
antibody of the invention can be used for screening a sample, such as a gastric extract or 
biopsy sample, for the presence of Helicobacter polypeptides. 



- 36 - 



!f« O ;R ;ll \., ,r %«a ^ ir, h 




For use in diagnostic methods, the reagent (e.g., the antibody, polypeptide, or 
polypeptide derivative of the invention) can be in a free state or can be immobilized on a 
solid support, such as, for example, on the interior surface of a tube or on the surface, or 
within pores, of a bead. Immobilization can be achieved using direct or indirect means. 
5 Direct means include passive adsorption (i.e., non-covalent binding) or covalent binding 
between the support and the reagent. By "indirect means" is meant that an anti-reagent 
compound that interacts with the reagent is first attached to the solid support. For 
example, if a polypeptide reagent is used, an antibody that binds to it can serve as an anti- 
reagent, provided that it binds to an epitope that is not involved in recognition of 

10 antibodies in biological samples. Indirect means can also employ a ligand-receptor 

system, for example, a molecule, such as a vitamin, can be grafted onto the polypeptide 
reagent and the corresponding receptor can be immobilized on the solid phase. This 
concept is illustrated by the well known biotin-streptavidin system. Alternatively, 
indirect means can be used, e.g., by adding to the reagent a peptide tail, chemically or by 

1 5 genetic engineering, and immobilizing the grafted or fused product by passive adsorption 
or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a process for 
purifying, from a biological sample, a polypeptide or polypeptide derivative of the 
invention, which involves carrying out antibody-based affinity chromatography with the 

2 0 biological sample, wherein the antibody is a monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be 
polyclonal or monospecific, and preferably is of the IgG type. Purified IgGs can be 
prepared from an antiserum using standard methods (see, e.g., Coligan et aL, supra). 
Conventional chromatography supports, as well as standard methods for grafting 

2 5 antibodies, are described, for example, by Harlow et aL (Antibodies: A Laboratory 

Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988). 

Briefly, a biological sample, such as an H. pylori extract, preferably in a buffer 
solution, is applied to a chromatography material, which is, preferably, equilibrated with 



the buffer used to dilute the biological sample, so that the polypeptide or polypeptide 
derivative of the invention (i.e., the antigen) is allowed to adsorb onto the material. The 
chromatography material, such as a gel or a resin coupled to an antibody of the invention, 
can be in batch form or in a column. The unbound components are washed off and the 
5 antigen is eluted with an appropriate elution buffer, such as a glycine buffer, a buffer 
containing a chaotropic agent, e.g., guanidine HC1, or a buffer having high salt 
concentration (e.g., 3 M MgCl 2 ). Eluted fractions are recovered and the presence of the 
antigen is detected, e.g., by measuring the absorbance at 280 nm. 

An antibody of the invention can be screened for therapeutic efficacy as 

10 follows. According to an eleventh aspect of the invention, there is provided (i) a 

composition of matter containing a monospecific antibody of the invention, together with 
a diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or 
prophylactically effective amount of a monospecific antibody of the invention, and (iii) a 
method for treating or preventing Helicobacter {e.g., H. pylori, H.felis, H. mustelae, or 

15 H. heilmanii) infection, by administering a therapeutic or prophylactic amount of a 

monospecific antibody of the invention to an individual in need of such treatment. In 
addition, the eleventh aspect of the invention includes the use of a monospecific antibody 
of the invention in the preparation of a medicament for treating or preventing 
Helicobacter infection. 

2 0 The monospecific antibody can be polyclonal or monoclonal, and is, 

preferably, predominantly of the IgA isotype. In passive immunization methods, the 
antibody is administered to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., 
orally or intragastrically, optionally, in the presence of a bicarbonate buffer. 
Alternatively, systemic administration, not requiring a bicarbonate buffer, can be carried 

2 5 out. A monospecific antibody of the invention can be administered as a single active 
agent or as a mixture with at least one additional monospecific antibody specific for a 
different Helicobacter polypeptide. The amount of antibody and the particular regimen 
used can be readily determined by one skilled in the art. For example, daily 




administration of about 100 to 1,000 mg of antibody over one week, or three doses per 
day of about 100 to 1,000 mg of antibody over two or three days, can be effective 
regimens for most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard methods 
5 in the art, e.g., by measuring induction of a mucosal immune response or induction of 
protective and/or therapeutic immunity, using, e.g., the H.felis mouse model and the 
procedures described by Lee et al. (Eur. J. Gastroenterology & Hepatology 7:303, 1995) 
or Lee et al. (J. Infect. Dis. 172:161, 1995). Those skilled in the art will recognize that 
the H. felis strain of the model can be replaced with another Helicobacter strain. For 

10 example, the efficacy of polynucleotide molecules and polypeptides from H. pylori is, 
preferably, evaluated in a mouse model using an H. pylori strain. Protection can be 
determined by comparing the degree of Helicobacter infection in the gastric tissue 
assessed by, for example, urease activity, bacterial counts, or gastritis, to that of a control 
group. Protection is shown when infection is reduced by comparison to the control 

15 group. Such an evaluation can be made for polynucleotides, vaccine vectors, 

polypeptides, and polypeptide derivatives, as well as for antibodies of the invention. 

For example, various doses of an antibody of the invention can be 
administered to the gastric mucosa of mice previously challenged with an H. pylori strain, 
as described, e.g., by Lee et al. (supra). Then, after an appropriate period of time, the 

2 0 bacterial load of the mucosa can be estimated by assessing urease activity, as compared to 
a control. Reduced urease activity indicates that the antibody is therapeutically effective. 

Adjuvants that can be used in any of the vaccine compositions described 
above are described as follows. Adjuvants for parenteral administration include, for 
example, aluminum compounds, such as aluminum hydroxide, aluminum phosphate, and 

2 5 aluminum hydroxy phosphate. The antigen can be precipitated with, or adsorbed onto, 
the aluminum compound using standard methods. Other adjuvants, such as RIBI 
(ImmunoChem, Hamilton, MT), can also be used in parenteral administration. 
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Adjuvants that can be used for mucosal administration include, for example, 
bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile toxin (LT), the 
Clostridium difficile toxin A, the pertussis toxin (PT), and combinations, subunits, 
toxoids, or mutants thereof. For example, a purified preparation of native cholera toxin 
5 subunit B (CTB) can be used. Fragments, homologs, derivatives, and fusions to any of 
these toxins can also be used, provided that they retain adjuvant activity. Preferably, a 
mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 
95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 
95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants that can be 

10 used in the methods and compositions of the invention include, e.g., Ser-63-Lys, Ala-69- 
Gly, Glu-1 10- Asp, and Glu-1 12- Asp mutants. Other adjuvants, such as the bacterial 
monophosphoryl lipid A (MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella 
typhimurium, or Shigella flexneri; saponins, and polylactide glycolide (PLGA) 
microspheres, can also be used in mucosal administration. Adjuvants useful for both 

15 mucosal and parenteral administrations, such as polyphosphazene (WO 95/2415), can 
also be used. 

Any pharmaceutical composition of the invention, containing a 
polynucleotide, polypeptide, polypeptide derivative, or antibody of the invention, can be 
manufactured using standard methods. It can be formulated with a pharmaceutically 

2 0 acceptable diluent or carrier, e.g., water or a saline solution, such as phosphate buffer 
saline, optionally, including a bicarbonate salt, such as sodium bicarbonate, e.g., 0.1 to 
0.5 M. Bicarbonate can advantageously be added to compositions intended for oral or 
intragastric administration. In general, a diluent or carrier can be selected on the basis of 
the mode and route of administration, and standard pharmaceutical practice. Suitable 

2 5 pharmaceutical carriers and diluents, as well as pharmaceutical necessities for their use in 
pharmaceutical formulations, are described in Remington's Pharmaceutical Sciences, a 
standard reference text in this field and in the USP/NF. 
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The invention also includes methods in which gastroduodenal infections, such 
as Helicobacter infection, are treated by oral administration of a Helicobacter polypeptide 
of the invention and a mucosal adjuvant, in combination with an antibiotic, an 
antisecretory agent, a bismuth salt, an antacid, sucralfate, or a combination thereof. 
5 Examples of such compounds that can be administered with the vaccine antigen and an 
adjuvant are antibiotics, including, e.g., macrolides, tetracyclines, p-lactams, 
aminoglycosides, quinolones, penicillins, and derivatives thereof (specific examples of 
antibiotics that can be used in the invention include, e.g., amoxicillin, clarithromycin, 
tetracycline, metronidizole, erythromycin, cefuroxime, and erythromycin); antisecretory 

10 agents, including, e.g., H 2 -receptor antagonists (e.g., cimetidine, ranitidine, famotidine, 
nizatidine, and roxatidine), proton pump inhibitors (e.g., omeprazole, lansoprazole, and 
pantoprazole), prostaglandin analogs (e.g., misoprostil and enprostil), and anticholinergic 
agents (e.g., pirenzepine, telenzepine, carbenoxolone, and proglumide); and bismuth salts, 
including colloidal bismuth subcitrate, tripotassium dicitrate bismuthate, bismuth 

15 subsalicylate, bicitropeptide, and pepto-bismol (see, e.g., Goodwin et al., 

Helicobacter pylori, Biology and Clinical Practice, CRC Press, Boca Raton, FL, pp 366- 
395, 1993; Physicians' Desk Reference, 49 th edn., Medical Economics Data Production 
Company, Montvale, New Jersey, 1995). In addition, compounds containing more than 
one of the above-listed components coupled together, e.g., ranitidine coupled to bismuth 

2 0 subcitrate, can be used. The invention also includes compositions for carrying out these 
methods, i.e., compositions containing a Helicobacter antigen (or antigens) of the 
invention, an adjuvant, and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and 

2 5 compositions of the invention can readily be determined by one skilled in the art. In 

addition, one skilled in the art can readily design treatment/immunization schedules. For 
example, the non- vaccine components can be administered on days 1-14, and the vaccine 
antigen + adjuvant can be administered on days 7, 14, 21, and 28. 




Methods and pharmaceutical compositions of the invention can be used to 
treat or to prevent Helicobacter infections and, accordingly, gastroduodenal diseases 
associated with these infections, including acute, chronic, and atrophic gastritis, and 
peptic ulcer diseases, e.g., gastric and duodenal ulcers. 
5 The invention is further illustrated by the following examples. Example 1 

describes identification of genes, such as genes that encode the polypeptides of the 
invention, in the Helicobacter genome, as well as identification of leader sequences, and 
primer design for amplification of genes lacking signal sequences. Example 2 describes 
cloning of DNA molecules encoding polypeptides of the invention into a vector that 
1 0 provides a histidine tag, and production and purification of the resulting his-tagged fusion 
proteins. Example 3 describes methods for cloning DNA encoding the polypeptides of 
the invention so that they can be produced without his-tags, and Example 4 describes 
methods for purifying recombinantly produced polypeptides of the invention. 

EXAMPLE 1: Identification of genes in the H. pylori genome, identification of 
1 5 leader sequences, and primer design for amplification of genes lacking signal 
sequences 

l.A. Creating H. pylori genomic databases 

The H. pylori genome was provided as a text file containing a single 
contiguous string of nucleotides that had been determined to be 1 .76 Megabases in length. 

2 0 The complete genome was split into 17 separate files using the program SPLIT 

(Creativity in Action), giving rise to 16 contigs, each containing 100,000 nucleotides, and 
a 17 th contig containing the remaining 76,000 nucleotides. A header was added to each of 
the 17 files using the format: >hpg0.txt (representing contig 1), .hpgl.txt (representing 
contig 2), etc. The resulting 17 files, named hpgO through hpgl6, were then copied 

2 5 together to form one file that represented the plus strand of the complete H. pylori 

genome. The constructed database was given the designation "H." A negative strand 
database of the H. pylori genome was created similarly by first creating a reverse 
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complement of the positive strand using the program SeqPup (D.G. Gilbert, Indiana 
University Biology Department) and then performing the same procedure as described 
above for the plus strand. This database was given the designation "N." 

The regions predicted to encode open reading frames (ORFs) were defined for 

5 the complete H. pylori genome using the program GENEMARK™ (Borodovsky et al. y 
Comp. Chem. 17:123, 1993). A database was created from a text file containing an 
annotated version of all ORFs predicted to be encoded by the H. pylori genome for both 
the plus and minus strands, and was given the designation "O." Each ORF was assigned 
a number indicating its location on the genome and its position relative to other genes. 

0 No manipulation of the text file was required. 

I.B. Searching the H. pylori databases 

The databases constructed as is described above were searched using the 
program FASTA (Pearson et aL, Proc. Natl. Acad. Sci. USA 85:2444-2448, 1988). 
FASTA was used for searching either a DNA sequence against either of the gene 
5 databases ("H" and/or "N"), or a peptide sequence against the ORF library ("O")- 

TFASTX was used to search a peptide sequence against all possible reading frames of a 
DNA database ("H" and/or "N" libraries). Potential frameshifts also being resolved, 
FASTX was used for searching the translated reading frames of a DNA sequence against 
either a DNA database, or a peptide sequence against the protein database. 

0 l.C. Isolation of DNA sequences from the H. pylori genome 

The FASTA searches against the constructed DNA databases identified exact 
nucleotide coordinates on one or more of the isolated contigs, and therefore the location 
of the target DNA. Once the exact location of the target sequence was known, the contig 
identified to carry the gene was exported into the software package MapDraw (DNAStar, 

5 Inc.) and the gene was isolated. Gene sequences with flanking DNA was then excised 
and copied into the EditSeq. Software package (DNAStar, Inc.) for further analysis. 
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I.D. Identification of leader sequences 

The deduced protein encoded by a target gene sequence is analyzed using the 
PROTEAN software package (DNAStar, Inc.). This analysis predicts those areas of the 
protein that are hydrophobic by using the Kyte-Doolittle algorithm, and identifies any 
5 potential polar residues preceding the hydrophobic core region, which is typical for many 
leader sequences. For confirmation, the target protein is then searched against a 
PROSITE database (DNAStar, Inc.) consisting of motifs and signatures. Characteristic of 
many leader sequences and hydrophobic regions in general, is the identification of 
predicted prokaryotic lipid attachment sites. Where confirmation between the two 
1 0 approaches is apparent at the N-terminus of any protein, putative cleavage sites are 

sought. Specifically, this includes the presence of either an Alanine (A), Serine (S), or 
Glycine (G) residue immediately after the core hydrophobic region. In the case of 
lipoproteins, a Cysteine (C) residue would be identified as the +1 residue, post-cleavage. 

I.E. Rational design of PGR primers based on the identification of leader sequences 

15 In order to clone gene sequences as N-terminus translational fusions for the 

generation of recombinant proteins with N-terminal Histidine tags, the gene sequence that 
specifies the leader sequence is omitted. The 5'-end of the gene-specific portion of the N- 
terminal primer is designed to start at the first codon beyond the cleavage site. In the case 
of lipoproteins, the 5'- end of the N-terminal primer begins at the second codon, 

20 immediately after the modifiable residue at position +1 post-cleavage. The omission of 
the leader sequence from the recombinant allows for one-step purification, and potential 
problems associated with insertion of leader sequences in the membrane of the host strain 
carrying the hybrid construct are avoided. 
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EXAMPLE 2: Preparation of isolated DNA encoding GHPO 136, GHPO 191, 
GHPO 411, GHPO 419, GHPO 724, and GHPO 427, and production of these 
polypeptides as histidine-tagged fusion proteins 
2.A. Preparation of genomic DNA from Helicobacter pylori 

5 Helicobacter pylori strain ORV2001, stored in LB medium containing 50% 

glycerol at -70°C, is grown on Colombia agar containing 7% sheep blood for 48 hours 
under microaerophilic conditions (8-10% C0 2 , 5-7% 0 2 , 85-87% N 2 ). Cells are 
harvested, washed with phosphate buffer saline (PBS) (pH 7.2), and DNA is then 
extracted from the cells using the Rapid Prep Genomic DNA Isolation kit (Pharmacia 
10 Biotech). 

2.B. PCR amplification 

DNA molecules encoding the polypeptides of the invention are amplified from 
genomic DNA, as can be prepared as is described above, by the Polymerase Chain 
Reaction (PCR) using primers set forth in the attached table. The N-terminal and C- 

15 terminal primers for each clone both include a 5* clamp and a restriction enzyme 

recognition sequence for cloning purposes (BamHI (GGATCC) and Xhol (CTCGAG) 
recognition sequences). 

Amplification of gene-specific DNA is carried out using Vent DNA 
Polymerase (New England Biolabs) or Taq DNA polymerase (Appligene), according to 

2 0 the manufacturer's instructions. The reaction mixture, which is brought to a final volume 
of 100 jiL with distilled water, is as follows: 



dNTPs mix 200 /iM 

lOx ThermoPol buffer 10 /iL 

2 5 primers 300 nM each 

DNA template 50 ng 

Heat-stable DNA polymerase 2 units 
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Appropriate amplification reaction conditions can readily be determined by one skilled in 
the art. Specific examples of conditions are set forth in the accompanying table. 

5 2.C. Transformation and selection of transformants 

A single PCR product is thus amplified and is then digested at 37 °C for 2 
hours with BamHl and Xhol concurrently in a 20 juL reaction volume. The digested 
product is ligated to similarly cleaved pET28a (Novagen) that is dephosphorylated prior 
to the ligation by treatment with Calf Intestinal Alkaline Phosphatase (CIP). The gene 

1 0 fusion constructed in this manner allows one-step affinity purification of the resulting 
fusion protein because of the presence of histidine residues at the N-terminus of the 
fusion protein, which are encoded by the vector. 

The ligation reaction (20 //L) is carried out at 14 °C overnight and then is used 
to transform 100 /jL, fresh E. coli XLl-blue competent cells (Novagen). The cells are 

15 incubated on ice for 2 hours, heat-shocked at 42 °C for 30 seconds, and returned to ice for 
90 seconds. The samples are then added to 1 mL LB broth in the absence of selection and 
grown at 37°C for 2 hours. The cells are plated out on LB agar containing kanamycin (50 
jUg/mL) at a lOx and neat dilution and incubated overnight at 37 °C. The following day, 
50 colonies are picked onto secondary plates and incubated at 37 °C overnight. 

2 0 Five colonies are picked into 3 mL LB broth supplemented with kanamycin 

(100 yUg/mL) and are grown overnight at 37°C. Plasmid DNA is extracted using the 
Quiagen mini-prep, method and is quantitated by agarose gel electrophoresis. 

PCR is performed with the gene-specific primers under the conditions set 
forth above and transformant DNA is confirmed to contain the desired insert. If PCR- 

2 5 positive, one of the five plasmid DNA samples (500 ng) extracted from the E. coli XLl- 
blue cells is used to transform competent BL21 (ADE3) E. coli competent cells (Novagen; 
as described previously). Transformants (10) are picked onto selective kanamycin (50 
/xg/mL) containing LB agar plates and stored as a research stock in LB containing 50% 
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glycerol. 

2.D. Purification of recombinant proteins 

One mL of frozen glycerol stock prepared as described in 2.C. is used to inoculate 
50 mL of LB medium containing 25 /ig/mL of kanamycin in a 250 mL Erlenmeyer flask. 
5 The flask is incubated at 37°C for 2 hours or until the absorbance at 600 nm (ODgoo) 
reaches 0.4-1.0. The culture is stopped from growing by placing the flask at 4°C 
overnight. The following day, 10 mL of the overnight culture are used to inoculate 240 
mL LB medium containing kanamycin (25 /xg/mL), with the initial ODgQO about 0.02- 
0.04. Four flasks are inoculated for each ORF. The cells are grown to an ODgQO of 1.0 

10 (about 2 hours at 37°C), a 1 mL sample is harvested by centrifugation, and the sample is 
analyzed by SDS-PAGE to detect any leaky expression. The remaining culture is induced 
with 1 mM IPTG and the induced cultures are grown for an additional 2 hours at 37°C. 

The final ODgQO * s taken and the cells are harvested by centrifugation at 5,000 x g 
for 15 minutes at 4°C. The supernatant is discarded and the pellets are resuspended in 50 

1 5 mM Tris-HCl (pH 8.0), 2 mM EDTA. Two hundred and fifty mL of buffer are used for 1 
L of culture and the cells are recovered by centrifugation at 12,000 x g for 20 minutes. 
The supernatant is discarded and the pellets are stored at -45°C. 

2. E. Protein purification 

Pellets obtained from 2.D. are thawed and resuspended in 95 mL of 50 mM Tris- 
2 0 HC1 (pH 8.0). Pefabloc and lysozyme are added to final concentrations of 100 ^iM and 
100 /ig/mL, respectively. The mixture is homogenized with magnetic stirring at 5°C for 
30 minutes. Benzonase (Merck) is added at a 1 U/mL final concentration, in the presence 
of 10 mM MgCl2, to ensure total digestion of the DNA. The suspension is sonicated 
(Branson Sonifier 450) for 3 cycles of 2 minutes each at maximum output. The 
2 5 homogenate is centrifuged at 19,000 x g for 15 minutes and both the supernatant and the 
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pellet are analyzed by SDS-PAGE to detect the cellular location of the target protein in 
the soluble or insoluble fractions, as is described further below. 

2.E.I. Soluble fraction 

If the target protein is produced in a soluble form (i.e., in the supernatant obtained 
5 in 2.E.) NaCl and imidazole are added to the supernatant to final concentrations of 50 
mM Tris-HCl (pH 8.0), 0.5 M NaCl, and 10 mM imidazole (buffer A). The mixture is 
filtered through a 0.45 fim membrane and loaded onto an IMAC column (Pharmacia 
HiTrap chelating Sepharose; 1 mL), which has been charged with nickel ions according to 
the manufacturer's recommendations. After loading, the column is washed with 50 
1 0 column volumes of buffer A and the recombinant target protein is eluted with 5 mL of 
buffer B (50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 500 mM imidazole). 

The elution profile is monitored by measuring the absorbance of the fractions at 
280 nm. Fractions corresponding to the protein peak are pooled, dialyzed against PBS 
containing 0.5 M arginine, filtered through a 0.22 fim membrane, and stored at -45°C. 

1 5 2JE.2, Insoluble fraction 

If the target protein is expressed in the insoluble fraction (pellets obtained from 
2.E.), purification is conducted under denaturing conditions. NaCl, imidazole, and urea 
are added to the resuspended pellet to final concentrations of 50 mM Tris-HCl (pH 8.0), 
0.5 M NaCl, 10 mM imidazole, and 6 M urea (buffer C). After complete solubilization, 
2 0 the mixture is filtered through a 0.45 fim membrane and loaded onto an IMAC column. 

The purification procedures on the IMAC column are the same as described in 
2.E.L, except that 6 M urea is included in all buffers used and 10 column volumes of 
buffer C are used to wash the column after protein loading, instead of 50 column 
volumes. 

2 5 The protein fractions eluted from the IMAC column with buffer D (buffer C 

containing 500 mM imidazole) are pooled. Arginine is added to the solution to final 
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concentration of 0.5 M and the mixture is dialyzed against PBS containing 0.5 M arginine 
and various concentrations of urea (4 M,3M,2M, 1 M, and 0.5 M) to progressively 
decrease the concentration of ,urea. The final dialysate is filtered through a 0.22 /im 
membrane and stored at -45 °C. 
5 Alternatively, when the above purification process is not as efficient as it should 

be, two other processes may be used as follows. A first alternative involves the use of a 
mild denaturant, N-octyl glucoside (NOG). Briefly, a pellet obtained in 2.E. is 
homogenized in 5 mM imidazole, 500 mM sodium chloride, 20 mM Tris-HCl (pH 7.9) 
by microfluidization at a pressure of 15,000 psi and is clarified by centrifugation at 4,000- 

10 5,000 x g. The pellet is recovered, resuspended in 50 mM NaP0 4 (pH 7.5) containing 1-2 
% weight /volume NOG, and homogenized. The NOG-soluble impurities are removed by 
centrifugation. The pellet is extracted once more by repeating the preceding extraction 
step. The pellet is dissolved in 8 M urea, 50 mM Tris (pH 8.0). The urea-solubilized 
protein is diluted with an equal volume of 2 M arginine, 50 mM Tris (pH 8.0), and is 

1 5 dialyzed against 1 M arginine for 24-48 hours to remove the urea. The final dialysate is 
filtered through a 0.22 fim membrane and stored at -45°C. 

A second alternative involves the use of a strong denaturant, such as guanidine 
hydrochloride. Briefly, a pellet obtained in 2.E. is homogenized in 5 mM imidazole, 500 
mM sodium chloride, 20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 

2 0 15,000 psi and clarified by centrifugation at 4,000-5,000 x g. The pellet is recovered, 
resuspended in 6 M guanidine hydrochloride, and passed through an IMAC column 
charged with Ni++. The bound antigen is eluted with 8 M urea (pH 8.5). Beta- 
mercaptoethanol is added to the eluted protein to a final concentration of 1 mM, then the 
eluted protein is passed through a Sephadex G-25 column equilibrated in 0. 1 M acetic 

2 5 acid. Protein eluted from the column is slowly added to 4 volumes of 50 mM phosphate 
buffer (pH 7.0). The protein remains in solution. 

2.F. Evaluation of the protective activity of the purified protein 
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Groups of 10 OF1 mice (IFFA Credo) are immunized rectally with 25 jug of the 
purified recombinant protein, admixed with 1 //g of cholera toxin (Berna) in 
physiological buffer. Mice are immunized on days 0, 7, 14, and 21. Fourteen days after 
the last immunization, the mice are challenged with H. pylori strain ORV2001 grown in 
5 liquid media (the cells are grown on agar plates, as described in 2.A., and, after harvest, 
the cells are resuspended in Brucella broth; the flasks are then incubated overnight at 
37 °C). Fourteen days after challenge, the mice are sacrificed and their stomachs are 
removed. The amount of H. pylori is determined by measuring the urease activity in the 
stomach and by culture. 

1 0 2.G. Production of monospecific polyclonal antibodies 
2.G.I. Hyperimmune rabbit antiserum 

New Zealand rabbits are injected both subcutaneously and intramuscularly with 
100 yug of a purified fusion polypeptide, as obtained in 2.E.I. or 2.E.2., in the presence of 
Freund's complete adjuvant and in a total volume of approximately 2 mL. Twenty one 
15 and 42 days after the initial injection, booster doses, which are identical to priming doses, 
except that Freund's incomplete adjuvant is used, are administered in the same way. 
Fifteen days after the last injection, animal serum is recovered, decomplemented, and 
filtered through a 0.45 jum membrane. 

2.G.2. Mouse hyperimmune ascites fluid 

2 0 Ten mice are injected subcutaneously with 10-50 jug of a purified fusion 

polypeptide as obtained in 2.E.I. or 2.E.2., in the presence of Freund's complete adjuvant 
and in a volume of approximately 200 juL. Seven and 14 days after the initial injection, 
booster doses, which are identical to the priming doses, except that Freund's incomplete 
adjuvant is used, are administered in the same way. Twenty one and 28 days after the 

2 5 initial infection, mice receive 50 jug of the antigen alone intraperitoneally. On day 21, 

mice are also injected intraperitoneally with sarcoma 180/TG cells CM26684 (Lennette et 
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al., Diagnostic Procedures for Viral, Rickettsial, and Chlamydial Infections, 5th Ed. 
Washington DC, American Public Health Association, 1979). Ascites fluid is collected 
10-13 days after the last injection. 

EXAMPLE 3: Methods for producing transcriptional fusions lacking His- tags 

5 Methods for amplification and cloning of DNA encoding the polypeptides of the 

invention as transcriptional fusions lacking His- tags are described as follows. Two PCR 
primers for each clone are designed based upon the sequences of the polynucleotides that 
encode them (see the attached sequence listing). These primers can be used to amplify 
DNA encoding the polypeptides of the invention from any Helicobacter pylori strain, 

10 including, for example, ORV2001 and the strain deposited as ATCC deposit number 
43579, as well as from other Helicobacter species. 

The N-terminal primers are designed to include the ribosome binding site of the 
target gene, the ATG start site, and any leader sequence and cleavage site. The N- 
terminal primers can include a 5' clamp and a restriction endonuclease recognition site, 

15 such as that for BamHl (GGATCC), which facilitates subsequent cloning. Similarly, the 
C-terminal primers can include a restriction endonuclease recognition site, such as that 
for Xhol (CTCGAG), which can be used in subsequent cloning, and a TAA stop codon. 

Amplification of genes encoding the polypeptides of the invention is carried out 
using Thermalase DNA Polymerase under the conditions described above in Example 2. 

2 0 Alternatively, Vent DNA polymerase (New England Biolabs), Pwo DNA polymerase 

(Boehringer Mannheim), or Taq DNA polymerase (Appligene) can be used, according to 
instructions provided by the manufacturers. 

A single PCR product for each clone is amplified and cloned into appropriately 
cleaved pET 24 (e.g., BamHl-Xhol cleaved pET 24), resulting in construction of a 

2 5 transcriptional fusion that permits expression of the proteins without His-tags. The 

expressed products can be purified as denatured proteins that are refolded by dialysis into 
1 M arginine. 




Cloning into pET 24 allows transcription of the genes from the T7 promoter, 
which is supplied by the vector, but relies upon binding of the RNA-specific DNA 
polymerase to the intrinsic ribosome binding sites of the genes, and thereby expression of 
the complete ORF. The amplification, digestion, and cloning protocols are as described 
5 above for constructing translational fusions. 

EXAMPLE 4: Purification of the polypeptides of the invention by immunoaffinity 
4.A. Purification of specific IgGs 

An immune serum, as prepared in section 2.G., is applied to a protein A 
Sepharose Fast Flow column (Pharmacia) equilibrated in 100 mM Tris-HCl (pH 8.0). 

10 The resin is washed by applying 10 column volumes of 100 mM Tris-HCl and 

10 volumes of 10 mM Tris-HCl (pH 8.0) to the column. IgG antibodies are eluted with 
0.1 M glycine buffer (pH 3.0) and are collected as 5 mL fractions to which is added 0.25 
mL 1 M Tris-HCl (pH 8.0). The optical density of the eluate is measured at 280 nm and 
the fractions containing the IgG antibodies are pooled, dialyzed against 50 mM Tris-HCl 

15 (pH 8.0), and, if necessary, stored frozen at 
-70°C. 

4.B. Preparation of the column 

An appropriate amount of CNBr-activated Sepharose 4B gel (1 g of dried gel 
provides for approximately 3.5 mL of hydrated gel; gel capacity is from 5 to 10 mg 
2 0 coupled IgG/mL of gel) manufactured by Pharmacia (17-0430-01) is suspended in 1 mM 
HC1 buffer and washed with a buchner by adding small quantities of 1 mM HC1 buffer. 
The total volume of buffer is 200 mL per gram of gel. 

Purified IgG antibodies are dialyzed for 4 hours at 20±5°C against 50 volumes of 
500 mM sodium phosphate buffer (pH 7.5). The antibodies are then diluted in 500 mM 
2 5 phosphate buffer (pH 7.5) to a final concentration of 3 mg/mL. 
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IgG antibodies are mixed with the gel overnight at 5±3°C. The gel is packed into 
a chromatography column and is washed with 2 column volumes of 500 mM phosphate 
buffer (pH 7.5), and 1 column volume of 50 mM sodium phosphate buffer, containing 
500 mM NaCl (pH 7.5). The gel is then transferred to a tube, mixed with 100 mM 
5 ethanolamine (pH 7.5) for 4 hours at room temperature, and washed twice with 2 column 
volumes of PBS. The gel is then stored in 1/10,000 PBS/merthiolate. The amount of IgG 
antibodies coupled to the gel is determined by measuring the optical density (OD) at 280 
nm of the IgG solution and the direct eluate, plus washings. 

4.C. Adsorption and elution of the antigen 

1 0 An antigen solution in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, for example, the 

supernatant obtained in 3.E. or the solubilized pellet obtained in 3.E., after centrifugation 
and filtration through a 0.45 //m membrane, is applied to a column equilibrated with 50 
mM Tris-HCl (pH 8.0), 2 mM EDTA, at a flow rate of about 10 mL/hour. The column is 
then washed with 20 volumes of 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Alternatively, 

1 5 adsorption can be achieved by mixing overnight at 5±3 °C. 

The adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium phosphate 
buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer (pH 2.5). The 
eluate is recovered in 3 mL fractions, to each of which is added 150 yuL of 1 M sodium 
phosphate buffer (pH 8.0). Absorption is measured at 280 nm for each fraction; those 

2 0 fractions containing the antigen are pooled and stored at 
-20°C. 

Other embodiments are within the following claims. 
What is claimed is: 
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